137 research outputs found

    Ontology of core data mining entities

    Get PDF
    In this article, we present OntoDM-core, an ontology of core data mining entities. OntoDM-core defines themost essential datamining entities in a three-layered ontological structure comprising of a specification, an implementation and an application layer. It provides a representational framework for the description of mining structured data, and in addition provides taxonomies of datasets, data mining tasks, generalizations, data mining algorithms and constraints, based on the type of data. OntoDM-core is designed to support a wide range of applications/use cases, such as semantic annotation of data mining algorithms, datasets and results; annotation of QSAR studies in the context of drug discovery investigations; and disambiguation of terms in text mining. The ontology has been thoroughly assessed following the practices in ontology engineering, is fully interoperable with many domain resources and is easy to extend

    S.cerevisiae Complex Function Prediction with Modular Multi-Relational Framework

    Full text link
    Proceeding of: 23rd International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2010, Córdoba, Spain, June 1-4, 2010Determining the functions of genes is essential for understanding how the metabolisms work, and for trying to solve their malfunctions. Genes usually work in groups rather than isolated, so functions should be assigned to gene groups and not to individual genes. Moreover, the genetic knowledge has many relations and is very frequently changeable. Thus, a propositional ad-hoc approach is not appropriate to deal with the gene group function prediction domain. We propose the Modular Multi-Relational Framework (MMRF), which faces the problem from a relational and flexible point of view. The MMRF consists of several modules covering all involved domain tasks (grouping, representing and learning using computational prediction techniques). A specific application is described, including a relational representation language, where each module of MMRF is individually instantiated and refined for obtaining a prediction under specific given conditions.This research work has been supported by CICYT, TRA 2007-67374-C02-02 project and by the expert biological knowledge of the Structural Computational Biology Group in Spanish National Cancer Research Centre (CNIO). The authors would like to thank members of Tilde tool developer group in K.U.Leuven for providing their help and many useful suggestions.Publicad

    Multi-Target Prediction: A Unifying View on Problems and Methods

    Full text link
    Multi-target prediction (MTP) is concerned with the simultaneous prediction of multiple target variables of diverse type. Due to its enormous application potential, it has developed into an active and rapidly expanding research field that combines several subfields of machine learning, including multivariate regression, multi-label classification, multi-task learning, dyadic prediction, zero-shot learning, network inference, and matrix completion. In this paper, we present a unifying view on MTP problems and methods. First, we formally discuss commonalities and differences between existing MTP problems. To this end, we introduce a general framework that covers the above subfields as special cases. As a second contribution, we provide a structured overview of MTP methods. This is accomplished by identifying a number of key properties, which distinguish such methods and determine their suitability for different types of problems. Finally, we also discuss a few challenges for future research

    Glycolysis Upregulation Is Neuroprotective As A Compensatory Mechanism In Als

    Get PDF
    Amyotrophic Lateral Sclerosis (ALS), is a fatal neurodegenerative disorder, with TDP-43 inclusions as a major pathological hallmark. Using a Drosophila model of TDP-43 proteinopathy we found significant alterations in glucose metabolism including increased pyruvate, suggesting that modulating glycolysis may be neuroprotective. Indeed, a high sugar diet improves locomotor and lifespan defects caused by TDP-43 proteinopathy in motor neurons or glia, but not muscle, suggesting that metabolic dysregulation occurs in the nervous system. Overexpressing human glucose transporter GLUT-3 in motor neurons mitigates TDP-43 dependent defects in synaptic vesicle recycling and improves locomotion. Furthermore, PFK mRNA, a key indicator of glycolysis, is upregulated in flies and patient derived iPSC motor neurons with TDP-43 pathology. Surprisingly, PFK overexpression rescues TDP-43 induced locomotor deficits. These findings from multiple ALS models show that mechanistically, glycolysis is upregulated in degenerating motor neurons as a compensatory mechanism and suggest that increased glucose availability is protective

    Tight regulation of ubiquitin-mediated DNA damage response by USP3 preserves the functional integrity of hematopoietic stem cells

    Get PDF
    Histone ubiquitination at DNA breaks is required for activation of the DNA damage response (DDR) and DNA repair. How the dynamic removal of this modification by deubiquitinating enzymes (DUBs) impacts genome maintenance in vivo is largely unknown. To address this question, we generated mice deficient for Ub-specific protease 3 (USP3; Usp3{delta}/{delta}), a histone H2A DUB which negatively regulates ubiquitin-dependent DDR signaling. Notably, USP3 deletion increased the levels of histone ubiquitination in adult tissues, reduced the hematopoietic stem cell (HSC) reserves over time, and shortened animal life span. Mechanistically, our data show that USP3 is important in HSC homeostasis, preserving HSC self-renewal, and repopulation potential in vivo and proliferation in vitro. A defective DDR and unresolved spontaneous DNA damage contribute to cell cycle restriction of Usp3{delta}/{delta} HSCs. Beyond the hematopoietic system, Usp3{delta}/{delta} animals spontaneously developed tumors, and primary Usp3{delta}/{delta} cells failed to preserve chromosomal integrity. These findings broadly support the regulation of chromatin ubiquitination as a key pathway in preserving tissue function through modulation of the response to genotoxic stress

    Predicting gene function using hierarchical multi-label decision tree ensembles

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>S. cerevisiae</it>, <it>A. thaliana </it>and <it>M. musculus </it>are well-studied organisms in biology and the sequencing of their genomes was completed many years ago. It is still a challenge, however, to develop methods that assign biological functions to the ORFs in these genomes automatically. Different machine learning methods have been proposed to this end, but it remains unclear which method is to be preferred in terms of predictive performance, efficiency and usability.</p> <p>Results</p> <p>We study the use of decision tree based models for predicting the multiple functions of ORFs. First, we describe an algorithm for learning hierarchical multi-label decision trees. These can simultaneously predict all the functions of an ORF, while respecting a given hierarchy of gene functions (such as FunCat or GO). We present new results obtained with this algorithm, showing that the trees found by it exhibit clearly better predictive performance than the trees found by previously described methods. Nevertheless, the predictive performance of individual trees is lower than that of some recently proposed statistical learning methods. We show that ensembles of such trees are more accurate than single trees and are competitive with state-of-the-art statistical learning and functional linkage methods. Moreover, the ensemble method is computationally efficient and easy to use.</p> <p>Conclusions</p> <p>Our results suggest that decision tree based methods are a state-of-the-art, efficient and easy-to-use approach to ORF function prediction.</p
    corecore